SEQUENCE LISTING 



(1)^ GENERAL INFORMATION: 

(i) APPLICANT: Israel, David 

Wolf man, Neil M 

(ii) TITLE OF INVENTION: RECOMBINANT BONE MORPHOGENETIC PROTEIN 
HETERODIMERS , COMPOSITIONS AND METHODS OF USE 

(iii) NUMBER OF SEQUENCES: 35 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Legal Affairs, Genetics Institute, Inc. 

(B) STREET: 87 CambridgePark Drive 

(C) CITY: Cambridge 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02140-2387 



(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/864,692 

(B) FILING DATE: 07-APR-1992 

(C) CLASSIFICATION: 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Kapinos, Ellen J. 

(B) REGISTRATION NUMBER: 32,245 

(C) REFERENCE/DOCKET NUMBER: GI-5192A 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617 876-1170 

(B) TELEFAX: 617 876-5851 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1607 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 356.. 1543 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



GT.CGACTCTA GAGTGTGTGT CAG^^TTGG CTGGGGACTT CTTGAACT'^CAGGGAGAAT 60 

AACTTGCGCA CCCCACTTTG CGCCGGTGCC TTTGCCCCAG CGGAGCCTGC TTCGCCATCT 12 0 

CCCAGCCCCA CCGCCCCTCC ACTCCTCGGC CTTGCCCGAC ACTGAGACGC TGTTCCCAGC 180 

GTGAAAAGAG AGACTGCGCG GCCGGCACCC GGGAGAAGGA GGAGGCAAAG AAAAGGAACG 240 

GACATTCGGT CCTTGCGCCA GGTCCTTTGA CCAGAGTTTT TCCATGTGGA CGCTCTTTCA 3 00 

ATGGACGTGT CCCCGCGTGC TTCTTAGACG GACTGCGGTC TCCTAAAGGT CGACC ATG 358 

Met 
1 

GTG GCC GGG ACC CGC TGT CTT CTA GCG TTG CTG CTT CCC CAG GTC CTC 406 
Val Ala Gly Thr Arg Cys Leu Leu Ala Leu Leu Leu Pro Gin Val Leu 
5 10 15 

CTG GGC GGC GCG GCT GGC CTC GTT CCG GAG CTG GGC CGC AGG AAG TTC 454 
Leu Gly Gly Ala Ala Gly Leu Val Pro Glu Leu Gly Arg Arg Lys Phe 
20 25 30 

GCG GCG GCG TCG TCG GGC CGC CCC TCA TCC CAG CCC TCT GAC GAG GTC 502 
Ala Ala Ala Ser Ser Gly Arg Pro Ser Ser Gin Pro Ser Asp Glu Val 
35 40 45 

CTG AGC GAG TTC GAG TTG CGG CTG CTC AGC ATG TTC GGC CTG AAA CAG 550 
Leu Ser Glu Phe Glu Leu Arg Leu Leu Ser Met Phe Gly Leu Lys Gin 
50 55 60 65 

AGA CCC ACC CCC AGC AGG GAC GCC GTG GTG CCC CCC TAC ATG CTA GAC 598 
Arg Pro Thr Pro Ser Arg Asp Ala Val Val Pro Pro Tyr Met Leu Asp 

70 75 80 

CTG TAT CGC AGG CAC TCA GGT CAG CCG GGC TCA CCC GCC CCA GAC CAC 646 
Leu Tyr Arg Arg His Ser Gly Gin Pro Gly Ser Pro Ala Pro Asp His 
85 90 95 

CGG TTG GAG AGG GCA GCC AGC CGA GCC AAC ACT GTG CGC AGC TTC CAC 694 
Arg Leu Glu Arg Ala Ala Ser Arg Ala Asn Thr Val Arg Ser Phe His 
100 105 110" 

CAT GAA GAA TCT TTG GAA GAA CTA CCA GAA ACG AGT GGG AAA ACA ACC 742 
His Glu Glu Ser Leu Glu Glu Leu Pro Glu Thr Ser Gly Lys Thr Thr 
115 120 125 

CGG AGA TTC TTC TTT AAT TTA AGT TCT ATC CCC ACG GAG GAG TTT ATC 790 
Arg Arg Phe Phe Phe Asn Leu Ser Ser lie Pro Thr Glu Glu Phe lie 
130 135 140 145 

ACC TCA GCA GAG CTT CAG GTT TTC CGA GAA CAG ATG CAA GAT GCT TTA 838 
Thr Ser Ala Glu Leu Gin Val Phe Arg Glu Gin Met Gin Asp Ala Leu 

150 155 160 

GGA AAC AAT AGC AGT TTC CAT CAC CGA ATT AAT ATT TAT GAA ATC ATA 886 
Gly Asn Asn Ser Ser Phe His His Arg He Asn He Tyr Glu He He 
165 170 175 



AAA CCT GCA ACA GCC AAC TCG AAA TTC CCC GTG ACC AGA CTT TTG GAC 
Lys Pro Ala Thr Ala Asn Ser Lys Phe Pro Val Thr Arg Leu Leu Asp 



934 



• 180 W185 190 

ACC AGG TTG GTG AAT CAG AAT GCA AGC AGG TGG GAA ACT TTT GAT GTC 982 
Thr^Arg Leu Val Asn Gin Asn Ala Ser Arg Trp Glu Thr Phe Asp Val 
195 200 205 

ACC CCC GCT GTG ATG CGG TGG ACT GCA CAG GGA CAC GCC AAC CAT GGA 1030 
Thr Pro Ala Val Met Arg Trp Thr Ala Gin Gly His Ala Asn His Gly 
210 215 220 225 

TTC GTG GTG GAA GTG GCC CAC TTG GAG GAG AAA CAA GGT GTC TCC AAG 1078 
Phe Val Val Glu Val Ala His Leu Glu Glu Lys Gin Gly Val Ser Lys 

230 235 240 

AGA CAT GTT AGG ATA AGC AGG TCT TTG CAC CAA GAT GAA CAC AGC TGG 1126 
Arg His Val Arg lie Ser Arg Ser Leu His Gin Asp Glu His Ser Trp 
245 250 255 

TCA CAG ATA AGG CCA TTG CTA GTA ACT TTT GGC CAT GAT GGA AAA GGG 1174 
Ser Gin lie Arg Pro Leu Leu Val Thr Phe Gly His Asp Gly Lys Gly 
260 265 270 

CAT CCT CTC CAC AAA AGA GAA AAA CGT CAA GCC AAA CAC AAA CAG CGG 1222 
His Pro Leu His Lys Arg Glu Lys Arg Gin Ala Lys His Lys Gin Arg 
275 280 285 

AAA CGC CTT AAG TCC AGC TGT AAG AGA CAC CCT TTG TAC GTG GAC TTC 1270 
Lys Arg Leu Lys Ser Ser Cys Lys Arg His Pro Leu Tyr Val Asp Phe 
290 295 300 305 

AGT GAC GTG GGG TGG AAT GAC TGG ATT GTG. GCT CCC CCG GGG TAT CAC 1318 
Ser Asp Val Gly Trp Asn Asp Trp lie Val Ala Pro Pro Gly Tyr His 

310 315 320 

GCC TTT TAC TGC CAC GGA GAA TGC CCT TTT CCT CTG GCT GAT CAT CTG 1366 
Ala Phe Tyr Cys His Gly Glu Cys Pro Phe Pro Leu Ala Asp His Leu 
325 330 335 

AAC TCC ACT AAT CAT GCC ATT GTT CAG ACG TTG GTC AAC TCT GTT AAC 1414 
Asn Ser Thr Asn His Ala lie Val Gin Thr Leu Val Asn Ser Val Asn 
340 345 350 

TCT AAG ATT CCT AAG GCA TGC TGT GTC CCG ACA GAA CTC AGT GCT ATC 1462 
Ser Lys lie Pro Lys Ala Cys Cys Val Pro Thr Glu Leu Ser Ala lie 
355 360 365 

TCG ATG CTG TAC CTT GAC GAG AAT GAA AAG GTT GTA TTA AAG AAC TAT 1510 
Ser Met Leu Tyr Leu Asp Glu Asn Glu Lys Val Val Leu Lys Asn Tyr 
370 375 380 385 

CAG GAC ATG GTT GTG GAG GGT TGT GGG TGT CGC TAGTACAGCA AAATTAAATA 1563 
Gin Asp Met Val Val Glu Gly Cys Gly Cys Arg 

390 395 

CATAAATATA TATATATATA TATATTTTAG AAAAAAGAAA AAAA 1607 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 



< 



(A) LENGTH: 3^r amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Val Ala Gly Thr Arg Cys Leu Leu Ala Leu Leu Leu Pro Gin Val 
15 10 15 

Leu Leu Gly Gly Ala Ala Gly Leu Val Pro Glu Leu Gly Arg Arg Lys 
20 25 30 

Phe Ala Ala Ala Ser Ser Gly Arg Pro Ser Ser Gin Pro Ser Asp Glu 
35 40 45 

Val Leu Ser Glu Phe Glu Leu Arg Leu Leu Ser Met Phe Gly Leu Lys 
50 55 " 60 

Gin Arg Pro Thr Pro Ser Arg Asp Ala Val Val Pro Pro Tyr Met Leu 
65 70 75 80 

Asp Leu Tyr Arg Arg His Ser Gly Gin Pro Gly Ser Pro Ala Pro Asp 

85 90 95 

His Arg Leu Glu Arg Ala Ala Ser Arg Ala Asn Thr Val Arg Ser Phe 
100 105 110 

His His Glu Glu Ser Leu Glu Glu Leu Pro Glu Thr Ser Gly Lys Thr 
115 120 125 

Thr Arg Arg Phe Phe Phe Asn Leu Ser Ser lie Pro Thr Glu Glu Phe 
130 135 140 

He Thr Ser Ala Glu Leu Gin Val Phe Arg Glu Gin Met Gin Asp Ala 
145 150 155 160 

Leu Gly Asn Asn Ser Ser Phe His His Arg He Asn He Tyr Glu He 

165 170 175 

He Lys Pro Ala Thr Ala Asn Ser Lys Phe Pro Val Thr Arg Leu Leu 
180 185 190 

Asp Thr Arg Leu Val Asn Gin Asn Ala Ser Arg Trp Glu Thr Phe Asp 
195 200 205 

Val Thr Pro Ala Val Met Arg Trp Thr Ala Gin Gly His Ala Asn His 
210 215 220 

Gly Phe Val Val Glu Val Ala His Leu Glu Glu Lys Gin Gly Val Ser 
225 230 235 240 

Lys Arg His Val Arg He Ser Arg Ser Leu His Gin Asp Glu His Ser 

245 250 255 

Trp Ser Gin He Arg Pro Leu Leu Val Thr Phe Gly His Asp Gly Lys 
260 265 " 270 

Gly His Pro Leu His Lys Arg Glu Lys Arg Gin Ala Lys His Lys Gin 



•• 275 W280 285 

Arg Lys Arg Leu Lys Ser Ser Cys Lys Arg His Pro Leu Tyr Val Asp 
290 295 300 

Phe Ser Asp Val Gly Trp Asn Asp Trp lie Val Ala Pro Pro Gly Tyr 
305 . 310 315 320 

His Ala Phe Tyr Cys His Gly Glu Cys Pro Phe Pro Leu Ala Asp His 

325 330 335 

Leu Asn Ser Thr Asn His Ala lie Val Gin Thr Leu Val Asn Ser Val 
340 345 350 

Asn Ser Lys lie Pro Lys Ala Cys Cys Val Pro Thr Glu Leu Ser Ala 
355 360 365 

lie Ser Met Leu Tyr Leu Asp Glu Asn Glu Lys Val Val Leu Lys Asn 
370 375 380 

Tyr Gin Asp Met Val Val Glu Gly Cys Gly Cys Arg 
385 390 395 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1954 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 403.. 1626 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CTCTAGAGGG CAGAGGAGGA GGGAGGGAGG GAAGGAGCGC GGAGCCCGGC CCGGAAGCTA 60 

GGTGAGTGTG GCATCCGAGC TGAGGGACGC GAGCCTGAGA CGCCGCTGCT GCTCCGGCTG 120 

AGTATCTAGC TTGTCTCCCC GATGGGATTC CCGTCCAAGC TATCTCGAGC CTGCAGCGCC 180 

ACAGTCCCCG GCCCTCGCCC AGGTTCACTG CAACCGTTCA GAGGTCCCCA GGAGCTGCTG 240 

CTGGCGAGCC CGCTACTGCA GGGACCTATG GAGCCATTCC GTAGTGCCAT CCCGAGCAAC 300 

GCACTGCTGC AGCTTCCCTG AGCCTTTCCA GCAAGTTTGT TCAAGATTGG CTGTCAAGAA 360 

TCATGGACTG TTATTATATG CCTTGTTTTC TGTCAAGACA CC ATG ATT CCT GGT 414 

Met lie Pro Gly 
1 

AAC CGA ATG CTG ATG GTC GTT TTA TTA TGC CAA GTC CTG CTA GGA GGC 462 
Asn Arg Met Leu Met Val Val Leu Leu Cys Gin Val Leu Leu Gly Gly 
5 10 15 20 



GCG AGC CAT GCT AGT TTG ATA CCT GAG ACG GGG AAG AAA AAA GTC GCC 510 
Ala Ser His Ala Ser Leu lie Pro Glu Thr Gly Lys Lys Lys Val Ala 

25 30 35 

GAG ATT CAG GGC CAC GCG GGA GGA CGC CGC TCA GGG CAG AGC CAT GAG 558 
Glu lie Gin Gly His Ala Gly Gly Arg Arg Ser Gly Gin Ser His Glu 
40 45 50 

CTC CTG CGG GAC TTC GAG GCG ACA CTT CTG CAG ATG TTT GGG CTG CGC 606 
Leu Leu Arg Asp Phe Glu Ala Thr Leu Leu Gin Met Phe Gly Leu Arg 
55 60 65 

CGC CGC CCG CAG CCT AGC AAG AGT GCC GTC ATT CCG GAC TAC ATG CGG 654 
Arg Arg Pro Gin Pro Ser Lys Ser Ala Val lie Pro Asp Tyr Met Arg 
70 75 80 

GAT CTT TAC CGG CTT CAG TCT GGG GAG GAG GAG GAA GAG CAG ATC CAC 702 
Asp Leu Tyr Arg Leu Gin Ser Gly Glu Glu Glu Glu Glu Gin He His 
85 90 95 100 

AGC ACT GGT CTT GAG TAT CCT GAG CGC CCG GCC AGC CGG GCC AAC ACC 750 
Ser Thr Gly Leu Glu Tyr Pro Glu Arg Pro Ala Ser Arg Ala Asn Thr 

105 110 115 

GTG AGG AGC TTC CAC CAC GAA GAA CAT CTG GAG AAC ATC CCA GGG ACC 798 
Val Arg Ser Phe His His Glu Glu His Leu Glu Asn He Pro Gly Thr 
120 125 130 

AGT GAA AAC TCT GCT TTT CGT TTC CTC TTT AAC CTC AGC AGC ATC CCT 846 
Ser Glu Asn Ser Ala Phe Arg Phe Leu Phe Asn Leu Ser Ser He Pro 
135 140 145 

GAG AAC GAG GTG ATC TCC TCT GCA GAG CTT CGG CTC TTC CGG GAG CAG 894 
Glu Asn Glu Val He Ser Ser Ala Glu Leu Arg Leu Phe Arg Glu Gin 
150 155 160 

GTG GAC CAG GGC CCT GAT TGG GAA AGG GGC TTC CAC CGT ATA AAC ATT 942 
Val Asp Gin Gly Pro Asp Trp Glu Arg Gly Phe His Arg He Asn He 
165 170 175 180 

TAT GAG GTT ATG AAG CCC CCA GCA GAA GTG GTG CCT GGG CAC CTC ATC 990 
Tyr Glu Val Met Lys Pro Pro Ala Glu Val Val Pro Gly His Leu He 

185 190 195 

ACA CGA CTA CTG GAC ACG AGA CTG GTC CAC CAC AAT GTG ACA CGG TGG 1038 
Thr Arg Leu Leu Asp Thr Arg Leu Val His His Asn Val Thr Arg Trp 
200 205 210 

GAA ACT TTT GAT GTG AGC CCT GCG GTC CTT CGC TGG ACC CGG GAG AAG 1086 
Glu Thr Phe Asp Val Ser Pro Ala Val Leu Arg Trp Thr Arg Glu Lys 
215 220 225 

CAG CCA AAC TAT GGG CTA GCC ATT GAG GTG ACT CAC CTC CAT CAG ACT 1134 
Gin Pro Asn Tyr Gly Leu Ala He Glu Val Thr His Leu His Gin Thr 
230 235 240 

CGG ACC CAC CAG GGC CAG CAT GTC AGG ATT AGC CGA TCG TTA CCT CAA 1182 
Arg Thr His Gin Gly Gin His Val Arg He Ser Arg Ser Leu Pro Gin 
24 5 250 255 260 



GGG . AGT GGG AAT TGG GCC 

Gly Ser Gly Asn Trp Ala 

265 

CAT GAT GGC CGG GGC CAT GCC TTG ACC CGA CGC CGG AGG GCC AAG CGT 1278 

His Asp Gly Arg Gly His Ala Leu Thr Arg Arg Arg Arg Ala Lys Arg 
280 285 290 

AGC CCT AAG CAT CAC TCA CAG CGG GCC AGG AAG AAG AAT AAG AAC TGC 132 6 

Ser Pro Lys His His Ser Gin Arg Ala Arg Lys Lys Asn Lys Asn Cys 
295 300 305 

CGG GGC CAC TCG CTC TAT GTG GAC TTC AGC GAT GTG GGC TGG AAT GAC 1374 
Arg Arg His Ser Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn Asp 
310 315 320 

TGG ATT GTG GCC CCA CCA GGC TAC CAG GCC TTC TAC TGC CAT GGG GAC 1422 
Trp He Val Ala Pro Pro Gly Tyr Gin Ala Phe Tyr Cys His Gly Asp 
325 330 335 340 

TGC CCC TTT CCA CTG GCT GAC CAC CTC AAC TCA ACC AAC CAT GCC ATT 1470 
Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr Asn His Ala He 

345 350 355 

GTG CAG ACC CTG GTC AAT TCT GTC AAT TCC AGT ATC CCC AAA GCC TGT 1518 
Val Gin Thr Leu Val Asn Ser Val Asn Ser Ser He Pro Lys Ala Cys 
360 365 370 

TGT GTG CCC ACT GAA CTG AGT GCC ATC TCC ATG CTG TAC CTG GAT GAG 1566 
Cys Val Pro Thr Glu Leu Ser Ala He Ser Met Leu Tyr Leu Asp Glu 
375 380 385 

TAT GAT AAG GTG GTA CTG AAA AAT TAT CAG GAG ATG GTA GTA GAG GGA 1614 
Tyr Asp Lys Val Val Leu Lys Asn Tyr Gin Glu Met Val Val Glu Gly 
390 395 400 

TGT GGG TGC CGC TGAGATCAGG CAGTCCTTGA GGATAGACAG ATATACACAC 1666 
Cys Gly Cys Arg 
405 

CACACACACA CACCACATAC ACCACACACA CACGTTCCCA TCCACTCACC CACACACTAC 
ACAGACTGCT TCCTTATAGC TGGACTTTTA TTTAAAAAAA AAAAAAAAAA AATGGAAAAA 
ATCCCTAAAC ATTCACCTTG ACCTTATTTA TGACTTTACG TGCAAATGTT TTGACCATAT 
TGATCATATA TTTTGACAAA ATATATTTAT AACTACGTAT TAAAAGAAAA AAATAAAATG 1906 
AGTCATTATT TTAAAAAAAA AAAAAAAACT CTAGAGTCGA CGGAATTC 1954 



CAG CTC CGG CCC CTC CTG GTC ACC TTT GGC 
Gin Leu Arg Pro Leu Leu Val Thr Phe Gly 
270 275 



1230 



1726 
1786 
1846 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: '408 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met lie Pro Gly Asn Arg Met Leu Met Val Val Leu Leu Cys Gin Val 
1 5 10 15 

Leu -Leu Gly Gly Ala Ser His Ala Ser Leu lie Pro Glu Thr Gly Lys 
20 25 30 

Lys Lys Val Ala Glu lie Gin Gly His Ala Gly Gly Arg Arg Ser Gly 
35 40 45 

Gin Ser His Glu Leu Leu Arg Asp Phe Glu Ala Thr Leu Leu Gin Met 
'50 55 60 

Phe Gly Leu Arg Arg Arg Pro Gin Pro Ser Lys Ser Ala Val lie Pro 
65 70 75 80 

Asp Tyr Met Arg Asp Leu Tyr Arg Leu Gin Ser Gly Glu Glu Glu Glu 

85 90 95 

Glu Gin lie His Ser Thr Gly Leu Glu Tyr Pro Glu Arg Pro Ala Ser 
100 105 110 

Arg Ala Asn Thr Val Arg Ser Phe His His Glu Glu His Leu Glu Asn 
115 120 125 

lie Pro Gly Thr Ser Glu Asn Ser Ala Phe Arg Phe Leu Phe Asn Leu 
130 135 140 

Ser Ser He Pro Glu Asn Glu Val He Ser Ser Ala Glu Leu Arg Leu 
145 150 155 160 

Phe Arg Glu Gin Val Asp Gin Gly Pro Asp Trp Glu Arg Gly Phe His 

165 170 175 

Arg He Asn He Tyr Glu Val Met Lys Pro Pro Ala Glu Val Val Pro 
180 185 190 

Gly His Leu He Thr Arg Leu Leu Asp Thr Arg Leu Val His His Asn 
195 200 205 

Val Thr Arg Trp Glu Thr Phe Asp Val Ser Pro Ala Val Leu Arg Trp 
210 215 220 

Thr Arg Glu Lys Gin Pro Asn Tyr Gly Leu Ala He Glu Val Thr His 
225 230 235 240 

Leu His Gin Thr Arg Thr His Gin Gly Gin His Val Arg He Ser Arg 

245 250 255 

Ser Leu Pro Gin Gly Ser Gly Asn Trp Ala Gin Leu Arg Pro Leu Leu 
260 265 270 

Val Thr Phe Gly His Asp Gly Arg Gly His Ala Leu Thr Arg Arg Arg 
275 280 285 

Arg Ala Lys Arg Ser Pro Lys His His Ser Gin Arg Ala Arg Lys Lys 
290 295 300 



Ash Lys' - Asn Cys Arg Arg Hi^feer Leu Tyr Val Asp Phe sl^Asp Val 

305 ■ 310 315 320 

Gly, Trp Asn Asp Trp lie Val Ala Pro Pro Gly Tyr Gin Ala Phe Tyr 

325 330 335 

Cys His Gly Asp Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr 
340 345 350 

Asn His Ala lie Val Gin Thr Leu Val Asn Ser Val Asn Ser Ser lie 
355 360 365 

Pro Lys Ala Cys Cys Val Pro Thr Glu Leu Ser Ala lie Ser Met Leu 
370 375 380 

Tyr Leu Asp Glu Tyr Asp Lys Val Val Leu Lys Asn Tyr Gin Glu Met 
385 390 395 400 

Val Val Glu Gly Cys Gly Cys Arg 

405 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1448 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 97.. 1389 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GTGACCGAGC GGCGCGGACG GCCGCCTGCC CCCTCTGCCA CCTGGGGCGG TGCGGGCCCG 60 

GAGCCCGGAG CCCGGGTAGC GCGTAGAGCC GGCGCG ATG CAC GTG CGC TCA CTG 114 

Met His Val Arg Ser Leu 
1 5 

CGA GCT GCG GCG CCG CAC AGC TTC GTG GCG CTC TGG GCA CCC CTG TTC 162 
Arg Ala Ala Ala Pro His Ser Phe Val Ala Leu Trp Ala Pro Leu Phe 
10 15 20 

CTG CTG CGC TCC GCC CTG GCC GAC TTC AGC CTG GAC AAC GAG GTG CAC 210 
Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser Leu Asp Asn Glu Val His 
25 30 35 

TCG AGC TTC ATC CAC CGG CGC CTC CGC AGC CAG GAG CGG CGG GAG ATG 258 
Ser Ser Phe lie His Arg Arg Leu Arg Ser Gin Glu Arg Arg Glu Met 
40 45 50 

CAG CGC GAG ATC CTC TCC ATT TTG GGC TTG CCC CAC CGC CCG CGC CCG 306 
Gin Arg Glu lie Leu Ser lie Leu Gly Leu Pro His Arg Pro Arg Pro 
55 60 65 70 



CAC 
His 


CTC 
Leu 


CAG 
Gin 


GGC 
Gly 


AAG 
Lys 
75 


CAC 
His 


AAC 
Asn 


TCG 
Ser 


GCA 
Ala 


CCC 
Pro 
80 


ATG 
Met 


TTC 
Phe 


ATG 
Met 


CTG 
Leu 


GAC 
Asp 
85 


CTG 
Leu 


354 


TAC 
Tyr 


AAC 
. Asn 


GCC 
Ala 


ATG 
Met 
90 


GCG 
Ala 


GTG 
Val 


GAG 
Glu 


GAG 
Glu 


GGC 
Gly 
95 


GGC 
Gly 


GGG 
Gly 


CCC 
Pro 


GGC 
Gly 


GGC CAG 
Gly Gin 
100 


GGC 
Gly 


402 


TTC 
Phe 


TCC 
Ser 


TAC 
Tyr 
105 


CCC 
Pro 


TAC 
Tyr 


AAG 
Lys 


GCC 
Ala 


GTC 
Val 
110 


TTC 
Phe 


AGT 
Ser 


ACC 
Thr 


CAG 
Gin 


GGC 
Gly 
115 


Pro 


Pro 


CTG 
Leu 


450 


GCC 
Ala 


AGC 
Ser 
120 


CTG 
Leu 


CAA 
Gin 


GAT 
Asp 


AGC 
Ser 


CAT 
His 
125 


TTC 
Phe 


CTC 
Leu 


ACC 
Thr 


GAC 
Asp 


GCC 
Ala 
130 


GAC 
Asp 


ATCS 

Met 


GTC 

Val 


ATG 
Met 


498 


AGC 
Ser 
135 


TTC 
Phe 


GTC 
Val 


AAC 
Asn 


CTC 
Leu 


GTG 
Val 
140 


GAA 
Glu 


CAT 
His 


GAC 
Asp 


AAG 
Lys 


GAA 
Glu 
145 


TTC 
Phe 


TTC 
Phe 


His 


CCA 

Pro 


CGC 
Arg 
150 


546 


TAC 
Tyr 


CAC 
His 


CAT 
His 


CGA 
Arg 


GAG 
Glu 
155 


TTC 
Phe 


CGG 
Arg 


TTT 
Phe 


GAT 
Asp 


CTT 
Leu 
160 


TCC 
Ser 


AAG 
Lys 


ATC 
He 


CCA 

Pro 


GAA 
Glu 
165 


GGG 
Gly 


594 


GAA 
Glu 


GCT 
Ala 


GTC 
Val 


ACG 
Thr 
170 


GCA 
Ala 


GCC 
Ala 


GAA 
Glu 


TTC 
Phe 


CGG 
Arg 
175 


ATC 
He 


TAC 
Tyr 


AAG 
Lys 


GAC 
Asp 


TAC 

Tyr 
180 


ATC 

He 


CGG 
Arg 


642 


GAA 
Glu 


CGC 
Arg 


TTC 
Phe 
185 


GAC 
Asp 


AAT 
Asn 


GAG 
Glu 


ACG 
Thr 


TTC 
Phe 
190 


CGG 
Arg 


ATC 
He 


AGC 
Ser 


GTT 
Val 


TAT 
Tyr 
195 


Gin 


PTP 

Val 


CTC 
Leu 


690 


CAG 
Gin 


GAG 
Glu 
200 


CAC 
His 


TTG 
Leu 


GGC 
Gly 


AGG 
Arg 


GAA 
Glu 
205 


TCG 
Ser 


GAT 
Asp 


CTC 
Leu 


TTC 
Phe 


CTG 
Leu 
210 


CTC 
Leu 


CAP 
Asp 


A(5P 

Ser 


CGT 
Arg 


738 


ACC 

Thr 
215 


CTC 
Leu 


TGG 
Trp 


GCC 
Ala 


TCG 
Ser 


GAG 
Glu 
220 


GAG 
Glu 


GGC 
Gly 


TGG 
Trp 


CTG 
Leu 


GTG 
Val 
225 


TTT 
Phe 


GAC 
Asp 


ATP 

He 


APA 

Thr 


GCC 
Ala 
230 


786 


ACC 

Thr 


AGC 
Ser 


AAC 
Asn 


CAC 
His 


TGG 
Trp 
235 


GTG 
Val 


GTC 
Val 


AAT 
Asn 


CCG 
Pro 


CGG 
Arg 
240 


CAC 
His 


AAC 
Asn 


CTG 
Leu 


Gly 


P m P 

Leu 
245 


CAG 
Gin 


834 


CTC 
Leu 


TCG 
Ser 


GTG 
Val 


GAG 
Glu 
250 


ACG 
Thr 


CTG 
Leu 


GAT 
Asp 


GGG 
Gly 


CAG 
Gin 
255 


AGC 
Ser 


ATC 
He 


AAC 
Asn 


CCC 
Pro 


A 7AP 

Lys 
260 


mm /-i 
Ilk] 

Leu 


GCG 
Ala 


882 


GGC 
Gly 


CTG 
Leu 


ATT 
He 
265 


GGG CGG 
Gly Arg 


CAC 
His 


GGG 
Gly 


CCC 
Pro 
270 


CAG 
Gin 


AAC 
Asn 


AAG 
Lys 


CAG 
Gin 


CCC 
Pro 
275 


TTC 
Phe 


ATG 
Met 


GTG 
Val 


930 


GCT 
Ala 


TTC 
Phe 
280 


TTC 
Phe 


AAG 
Lys 


GCC 
Ala 


ACG 
Thr 


GAG 
Glu 
285 


GTC 
Val 


CAC 
His 


TTC 
Phe 


CGC 
Arg 


AGC 
Ser 
290 


ATC 
He 


CGG 
Arg 


TCC 
Ser 


ACG 
Thr 


978 


GGG 
Gly 
295 


AGC 
Ser 


AAA 
Lys 


CAG 
Gin 


CGC 
Arg 


AGC 
Ser 
300 


CAG 
Gin 


AAC 
Asn 


CGC 
Arg 


TCC 
Ser 


AAG 
Lys 
305 


ACG 
Thr 


CCC 
Pro 


AAG 
Lys 


AAC 
Asn 


CAG 
Gin 
310 


1026 



GAA GCC CTG CGG ATG GCC AAC GTG GCA GAG AAC AGC AGC AGC GAC CAG 1074 
Glu Ala Leu Arg Met Ala Asn Val Ala Glu Asn Ser Ser Ser Asp Gin 

315 320 325 

AGG CAG GCC TGT AAG AAG CAC GAG CTG TAT GTC AGC TTC CGA GAC CTG 1122 
Arg. Gin Ala Cys Lys Lys His Glu Leu Tyr Val Ser Phe Arg Asp Leu 
330 335 340 

GGC TGG CAG GAC TGG ATC ATC GCG CCT GAA GGC TAC GCC GCC TAC TAC 1170 
Gly Trp Gin Asp Trp lie lie Ala Pro Glu Gly Tyr Ala Ala Tyr Tyr 
345 350 355 

TGT dAG GGG GAG TGT GCC TTC CCT CTG AAC TCC TAC ATG AAC GCC ACC 1218 
Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn Ser Tyr Met Asn Ala Thr 
360 365 370 

AAC CAC GCC ATC GTG CAG ACG CTG GTC CAC TTC ATC AAC CCG GAA ACG 1266 
Asn His Ala lie Val Gin Thr Leu Val His Phe lie Asn Pro Glu Thr 
375 380 385 390 

GTG CCC AAG CCC TGC TGT GCG CCC ACG CAG CTC AAT GCC ATC TCC GTC 1314 
Val Pro Lys Pro Cys Cys Ala Pro Thr Gin Leu Asn Ala lie Ser Val 

395 400 405 

CTC TAC TTC GAT GAC AGC TCC AAC GTC ATC CTG AAG AAA TAC AGA AAC 1362 
Leu Tyr Phe Asp Asp Ser Ser Asn Val lie Leu Lys Lys Tyr Arg Asn 
410 415 420 

ATG GTG GTC CGG GCC TGT GGC TGC CAC TAGCTCCTCC GAGAATTCAG 1409 
Met Val Val Arg Ala Cys Gly Cys His 
425 430 

ACCCTTTGGG GCCAAGTTTT TCTGGATCCT CCATTGCTC 1448 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met His Val Arg Ser Leu Arg Ala Ala Ala Pro His Ser Phe Val Ala 
1 5 10 15 

Leu Trp Ala Pro Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser 
20 25 30 

Leu Asp Asn Glu Val His Ser Ser Phe He His Arg Arg Leu Arg Ser 
35 40 45 

Gin Glu Arg Arg Glu Met Gin Arg Glu He Leu Ser He Leu Gly Leu 
50 55 60 



Pro His Arg Pro Arg Pro His Leu Gin Gly Lys His Asn Ser Ala Pro 



65 



70 



75 



80 



Met Phe Met Leu Asp Leu Tyr Asn Ala Met Ala Val Glu Glu Gly Gly 

85 90 95 

Gly Pro Gly Gly Gin Gly Phe Ser Tyr Pro Tyr Lys Ala Val Phe Ser 
100 105 110 

Thr Gin Gly Pro Pro Leu Ala Ser Leu Gin Asp Ser His Phe Leu Thr 
115 120 125 

Asp Ala Asp Met Val Met Ser Phe Val Asn Leu Val Glu His Asp Lys 
130 135 140 

Glu Phe Phe His Pro Arg Tyr His His Arg Glu Phe Arg Phe Asp Leu 
145 150 155 160 

Ser Lys He Pro Glu Gly Glu Ala Val Thr Ala Ala Glu Phe Arg He 

165 170 175 

Tyr Lys Asp Tyr He Arg Glu Arg Phe Asp Asn Glu Thr Phe Arg He 
180 185 190 



Ser Val Tyr Gin Val Leu Gin Glu His Leu Gly Arg Glu Ser Asp Leu 
195 200 205 

Phe Leu Leu Asp Ser Arg Thr Leu Trp Ala Ser Glu Glu Gly Trp Leu 
210 215 220 

Val Phe Asp He Thr Ala Thr Ser Asn His Trp Val Val Asn Pro Arg 
225 230 235 240 

His Asn Leu Gly Leu Gin Leu Ser Val Glu Thr Leu Asp Gly Gin Ser 

2 45 250 255 

He Asn Pro Lys Leu Ala Gly Leu lie Gly Arg His Gly Pro Gin Asn 
260 265 270 

Lys Gin Pro Phe Met Val Ala Phe Phe Lys Ala Thr Glu Val His Phe 
275 280 285 

Arg Ser He Arg Ser Thr Gly Ser Lys Gin Arg Ser Gin Asn Arg Ser 
2 90 295 300 

Lys Thr Pro Lys Asn Gin Glu Ala Leu Arg Met Ala Asn Val Ala Glu 
305 310 315 320 

Asn Ser Ser Ser Asp Gin Arg Gin Ala Cys Lys Lys His Glu Leu Tyr 

325 330 335 

Val Ser Phe Arg Asp Leu Gly Trp Gin Asp Trp He He Ala Pro Glu 
340 345 350 

Gly Tyr Ala Ala Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn 
355 360 365 

Ser Tyr Met Asn Ala Thr Asn His Ala He Val Gin Thr Leu Val His 
370 375 380 

Phe He Asn Pro Glu Thr Val Pro Lys Pro Cys Cys Ala Pro Thr Gin 



385 390 V 395 W 400 

Leu Asn Ala lie Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn Val lie 
j 405 410 415 

Leu Lys Lys Tyr Arg Asn Met Val Val Arg Ala Cys Gly Cys His 
420 425 430 

(2) INFORMATION FOR SEQ ID NO: 7: 

'(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2923 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: CDNA to ItlRNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(F) TISSUE TYPE: Human placenta 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: Stratagene catalog #936203 Human placenta 

cDNA library 

(B) CLONE: BMP6C35 

(viii) POSITION IN GENOME: 

(C) UNITS: bp 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 160.. 1701 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1282.. 1698 

(ix) FEATURE: 

(A) NAME/ KEY: mRNA 

(B) LOCATION: 1..2923 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CGACCATGAG AGATAAGGAC TGAGGGCCAG GAAGGGGAAG CGAGCCCGCC GAGAGGTGGC 60 

GGGGACTGCT CACGCCAAGG GCCACAGCGG CCGCGCTCCG GCCTCGCTCC GCCGCTCCAC 120 

GCCTCGCGGG ATCCGCGGGG GCAGCCCGGC CGGGCGGGG ATG CCG GGG CTG GGG 174 

Met Pro Gly Leu Gly 
-374 -370 

CGG AGG GCG CAG TGG CTG TGC TGG TGG TGG GGG CTG CTG TGC AGC TGC 222 
Arg Arg Ala Gin Trp Leu Cys Trp Trp Trp Gly Leu Leu Cys Ser Cys 

-365 -360 -355 

TGC GGG CCC CCG CCG CTG CGG CCG CCC TTG CCC GCT GCC GCG GCC GCC 270 



Cys Gly Pro Pro Pro Leu A^JPro Pro Leu Pro Ala Ala AH Ala Ala 
-350 -345 -340 

GCC GCC GGG GGG CAG CTG CTG GGG GAC GGC GGG AGC CCC GGC CGC ACG 
Ala Ala Gly Gly Gin Leu Leu Gly Asp Gly Gly Ser Pro Gly Arg Thr 
-335 -330 -325 

GAG CAG CCG CCG CCG TCG CCG CAG TCC TCC TCG GGC TTC CTG TAC CGG 
Glu Gin Pro Pro Pro Ser Pro Gin Ser Ser Ser Gly Phe Leu Tyr Arg 
-320 -315 -310 

CGG CTC AAG ACG CAG GAG AAG CGG GAG ATG CAG AAG GAG ATC TTG TCG 
Arg Leu Lys Thr Gin Glu Lys Arg Glu Met Gin Lys Glu lie Leu Ser 
-305 ; -300 -295 -290 

GTG CTG GGG CTC CCG CAC CGG CCC CGG CCC CTG CAC GGC CTC CAA CAG 
Val Leu Gly Leu Pro His Arg Pro Arg Pro Leu His Gly Leu Gin Gin 

-285 -280 -275 

CCG CAG CCC CCG GCG CTC CGG CAG CAG GAG GAG CAG CAG CAG CAG CAG 
Pro Gin Pro Pro Ala Leu Arg Gin Gin Glu Glu Gin Gin Gin Gin Gin 
-270 -265 -260 

CAG CTG CCT CGC GGA GAG CCC CCT CCC GGG CGA CTG AAG TCC GCG CCC 
Gin Leu Pro Arg Gly Glu Pro Pro Pro Gly Arg Leu Lys Ser Ala Pro 
-255 -250 -245 

CTC TTC ATG CTG GAT CTG TAC AAC GCC CTG TCC GCC GAC AAC GAC GAG 
Leu Phe Met Leu Asp Leu Tyr Asn Ala Leu Ser Ala Asp Asn Asp Glu 
-240 -235 -230 

GAC GGG GCG TCG GAG GGG GAG AGG CAG CAG TCC TGG CCC CAC GAA GCA 
Asp Gly Ala Ser Glu Gly Glu Arg Gin Gin Ser Trp Pro His Glu Ala 
"225 -220 -215 -210 

GCC AGC TCG TCC CAG CGT CGG CAG CCG CCC CCG GGC GCC GCG CAC CCG 
Ala Ser Ser Ser Gin Arg Arg Gin Pro Pro Pro Gly Ala Ala His Pro 

-205 -200 -195 

CTC AAC CGC AAG AGC CTT CTG GCC CCC GGA TCT GGC AGC GGC GGC GCG 
Leu Asn Arg Lys Ser Leu Leu Ala Pro Gly Ser Gly Ser Gly Gly Ala 
-190 -185 -180 

TCC CCA CTG ACC AGC GCG CAG GAC AGC GCC TTC CTC AAC GAC GCG GAC 
Ser Pro Leu Thr Ser Ala Gin Asp Ser Ala Phe Leu Asn Asp Ala Asp 
-175 -170 -165 

ATG GTC ATG AGC TTT GTG AAC CTG GTG GAG TAC GAC AAG GAG TTC TCC 
Met Val Met Ser Phe Val Asn Leu Val Glu Tyr Asp Lys Glu Phe Ser 
-160 -155 -150 

CCT CGT CAG CGA CAC CAC AAA GAG TTC AAG TTC AAC TTA TCC CAG ATT 
Pro Arg Gin Arg His His Lys Glu Phe Lys Phe Asn Leu Ser Gin He 
~ 145 -140 -135 -130 

CCT GAG GGT GAG GTG GTG ACG GCT GCA GAA TTC CGC ATC TAC AAG GAC 
Pro Glu Gly Glu Val Val Thr Ala Ala Glu Phe Arg He Tyr Lys Asp 

-125 -120 -115 



TGT GTT ATG GGG AGT TTT AAA AAC CAA ACT TTT CTT ATC AGC ATT TAT 



Cys Val Met Gly Ser Phe Ly^Asn Gin Thr Phe Leu He He Tyr 

-110 -105 -100 

CAA GTC TTA CAG GAG CAT CAG CAC AGA GAC TCT GAC CTG TTT TTG TTG 1038 
Glri Val Leu Gin Glu His Gin His Arg Asp Ser Asp Leu Phe Leu Leu 
-95 -90 -85 

GAC "ACC CGT GTA GTA TGG GCC TCA GAA GAA GGC TGG CTG GAA TTT GAC 1086 
Asp Thr Arg Val Val Trp Ala Ser Glu Glu Gly Trp Leu Glu Phe Asp 
-80 -75 -70 

ATC ACG GCC ACT AGC AAT CTG TGG GTT GTG ACT CCA CAG CAT AAC ATG 1134 
He Thr Ala Thr Ser Asn Leu Trp Val Val Thr Pro Gin His Asn Met 
"65 • -60 -55 -50 

GGG CTT CAG CTG AGC GTG GTG ACA AGG GAT GGA GTC CAC GTC CAC CCC 1182 
Gly Leu Gin Leu Ser Val Val Thr Arg Asp Gly Val His Val His Pro 

-45 -40 -35 

CGA GCC GCA GGC CTG GTG GGC AGA GAC GGC CCT TAC GAT AAG CAG CCC 1230 
Arg Ala Ala Gly Leu Val Gly Arg Asp Gly Pro Tyr Asp Lys Gin Pro 
-30 -25 -20 

TTC ATG GTG GCT TTC TTC AAA GTG AGT GAG GTC CAC GTG CGC ACC ACC 1278 
Phe Met Val Ala Phe Phe Lys Val Ser Glu Val His Val Arg Thr Thr 
-15 -10 -5 

AGG TCA GCC TCC AGC CGG CGC CGA CAA CAG AGT CGT AAT CGC TCT ACC 1326 
Arg Ser Ala Ser Ser Arg Arg Arg Gin Gin Ser Arg Asn Arg Ser Thr 
15 10 15 

CAG TCC CAG GAC GTG GCG CGG GTC TCC AGT GCT TCA GAT TAC AAC AGC 1374 
Gin Ser Gin Asp Val Ala Arg Val Ser Ser Ala Ser Asp Tyr Asn Ser 

20 25 30 

AGT GAA TTG AAA ACA GCC TGC AGG AAG CAT GAG CTG TAT GTG AGT TTC 1422 
Ser Glu Leu Lys Thr Ala Cys Arg Lys His Glu Leu Tyr Val Ser Phe 
35 40 45 

CAA GAC CTG GGA TGG CAG GAC TGG ATC ATT GCA CCC AAG GGC TAT GCT 1470 
Gin Asp Leu Gly Trp Gin Asp Trp He He Ala Pro Lys Gly Tyr Ala 
50 55 60 

GCC AAT TAC TGT GAT GGA GAA TGC TCC TTC CCA CTC AAC GCA CAC ATG 1518 
Ala Asn Tyr Cys Asp Gly Glu Cys Ser Phe Pro Leu Asn Ala His Met 
65 70 75 

AAT GCA ACC AAC CAC GCG ATT GTG CAG ACC TTG GTT CAC CTT ATG AAC 1566 
Asn Ala Thr Asn His Ala He Val Gin Thr Leu Val His Leu Met Asn 
80 85 90 95 

CCC GAG TAT GTC CCC AAA CCG TGC TGT GCG CCA ACT AAG CTA AAT GCC 1614 
Pro Glu Tyr Val Pro Lys Pro Cys Cys Ala Pro Thr Lys Leu Asn Ala 

100 105 110 

ATC TCG GTT CTT TAC TTT GAT GAC AAC TCC AAT GTC ATT CTG AAA AAA 1662 
He Ser Val Leu Tyr Phe Asp Asp Asn Ser Asn Val He Leu Lys Lys 
115 120 125 

TAC AGG AAT ATG GTT GTA AGA GCT TGT GGA TGC CAC TAACTCGAAA 1708 



Tyr Arg Asn Met Val Val Ai^Ala Cys Gly Cys His 

130 135 140 



CCAGATGCTG 


GGGACACACA 


TTCTGCCTTG 


GATTCCTAGA 


TTACATCTGC 


CTTAAAAAAA 


1768 


CACGGAAGCA 


CAGTTGGAGG 


TGGGACGATG 


AGACTTTGAA 


ACTATCTCAT 


GCCAGTGCCT 


1828 


TATTACCCAG 


GAAGATTTTA 


AAGGACCTCA 


TTAATAATTT 


GCTCACTTGG 


TAAATGACGT 


1888 


GAGTAGTTGT 


TGGTCTGTAG 


CAAGCTGAGT 


TTGGATGTCT 


GTAGCATAAG 


GTCTGGTAAC 


1948 


TGCAGAAACA 


TAACCGTGAA 


GCTCTTCCTA 


CCCTCCTCCC 


CCAAAAACCC 


ACCAAAATTA 


2008 


GTTTTAGCTG 


TAGATCAAGC 


TATTTGGGGT 


GTTTGTTAGT 


AAATAGGGAA 


AATAATCTCA 


2068 


AAGGAGTTAA 


ATGTATTCTT 


GGCTAAAGGA 


TCAGCTGGTT 


CAGTACTGTC 


TATCAAAGGT 


2128 


AGATTTTACA 


GAGAACAGAA 


ATCGGGGAAG 


TGGGGGGAAC 


GCCTCTGTTC 


AGTTCATTCC 


2188 


CAGAAGTCCA 


CAGGACGCAC 


AGCCCAGGCC 


ACAGCCAGGG 


CTCCACGGGG 


CGCCCTTGTC 


2248 


TCAGTCATTG 


CTGTTGTATG 


TTCGTGCTGG 


AGTTTTGTTG 


GTGTGAAAAT 


ACACTTATTT 


2308 


CAGCCAAAAC 


ATACCATTTC 


TACACCTCAA 


TCCTCCATTT 


GCTGTACTCT 


TTGCTAGTAC 


2368 


CAAAAGTAGA 


CTGATTACAC 


TGAGGTGAGG 


CTACAAGGGG 


TGTGTAACCG 


TGTAACACGT 


2428 


GAAGGCAGTG 


CTCACCTCTT 


CTTTACCAGA 


ACGGTTCTTT 


GACCAGCACA 


TTAACTTCTG 


2488 


GACTGCCGGC 


TCTAGTACCT 


TTTCAGTAAA 


GTGGTTCTCT 


GCCTTTTTAC 


TATACAGCAT 


2548 


ACCACGCCAC 


AGGGTTAGAA 


CCAACGAAGA 


AAATAAAATG 


AGGGTGCCCA 


GCTTATAAGA 


2608 


ATGGTGTTAG 


GGGGATGAGC 


ATGCTGTTTA 


TGAACGGAAA 


TCATGATTTC 


CCTGTAGAAA 


2668 


GTGAGGCTCA 


GATTAAATTT 


TAGAATATTT 


TCTAAATGTC 


TTTTTCACAA 


TCATGTGACT 


2728 


GGGAAGGCAA 


TTTCATACTA 


AACTGATTAA 


ATAATACATT 


TATAATCTAC 


AACTGTTTGC 


2788 


ACTTACAGCT 


TTTTTTGTAA 


ATATAAACTA 


TAATTTATTG 


TCTATTTTAT 


ATCTGTTTTG 


2848 


CTGTGGCGTT 


GGGGGGGGGG 


CCGGGCTTTT 


GGGGGGGGGG 


GTTTGTTTGG 


GGGGTGTCGT 


2908 


GGTGTGGGCG 


GGCGG 










2923 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 513 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Pro Gly Leu Gly Arg Arg Ala Gin Trp Leu Cys Trp Trp Trp Gly 
-374 -370 -365 -360 



Leu Leu Cys Ser Cys Cys GJ^Pro Pro Pro Leu Arg Pro pW Leu Pro 
-355 -350 -345 

Ala Ala Ala Ala Ala Ala Ala Gly Gly Gin Leu Leu Gly Asp Gly Gly 
-340 -335 -330 

Ser; Pro Gly Arg Thr Glu Gin Pro Pro Pro Ser Pro Gin Ser Ser Ser 
-325 -320 -315 

Gly Phe Leu Tyr Arg Arg Leu Lys Thr Gin Glu Lys Arg Glu Met Gin 
-310 ? -305 -300 -295 

Lys Glu He Leu Ser Val Leu Gly Leu Pro His Arg Pro Arg Pro Leu 

-290 -285 -280 

His Gly Leu Gin Gin Pro Gin Pro Pro Ala Leu Arg Gin Gin Glu Glu 
-275 -270 -265 

Gin Gin Gin Gin Gin Gin Leu Pro Arg Gly Glu Pro Pro Pro Gly Arg 
-260 -255 -250 

Leu Lys Ser Ala Pro Leu Phe Met Leu Asp Leu Tyr Asn Ala Leu Ser 
-245 -240 -235 

Ala Asp Asn Asp Glu Asp Gly Ala Ser Glu Gly Glu Arg Gin Gin Ser 
-230 -225 -220 -215 

Trp Pro His Glu Ala Ala Ser Ser Ser Gin Arg Arg Gin Pro Pro Pro 

-210 -205 -200 

Gly Ala Ala His Pro Leu Asn Arg Lys Ser Leu Leu Ala Pro Gly Ser 
-195 -190 -185 

Gly Ser Gly Gly Ala Ser Pro Leu Thr Ser Ala Gin Asp Ser Ala Phe 
-180 -175 -170 

Leu Asn Asp Ala Asp Met Val Met Ser Phe Val Asn Leu Val Glu Tyr 
-165 -160 -155 

Asp Lys Glu Phe Ser Pro Arg Gin Arg His His Lys Glu Phe Lys Phe 
-150 -145 -140 -135 

Asn Leu Ser Gin He Pro Glu Gly Glu Val Val Thr Ala Ala Glu Phe 

-130 -125 -120 

Arg He Tyr Lys Asp Cys Val Met Gly Ser Phe Lys Asn Gin Thr Phe 
-115 -110 -105 

Leu He Ser He Tyr Gin Val Leu Gin Glu His Gin His Arg Asp Ser 
-100 -95 -90 

Asp Leu Phe Leu Leu Asp Thr Arg Val Val Trp Ala Ser Glu Glu Gly 
-85 -80 -75 



Trp Leu Glu Phe Asp He Thr Ala Thr Ser Asn Leu Trp Val Val Thr 
-70 -65 -60 -55 

Pro Gin His Asn Met Gly Leu Gin Leu Ser Val Val Thr Arg Asp Gly 

-50 -45 -40 



Val His ! 

Tyr, Asp 

His *val 
-5 

Arg Asn 

Ser Asp 

Leu Tyr 

Pro Lys 
60 

Leu Asn 
75 

Val His 
Thr Lys 
Val He 
His 



Val His Pro 
-35 

Lys Gin Pro 
-20 

Arg Thr Thr 



Arg Ser Thr 
15 

Tyr Asn Ser 
30 

Val Ser Phe 
45 

Gly Tyr Ala 



Ala His Met 



Leu Met Asn 
95 

Leu Asn Ala 
110 

Leu Lys Lys 
125 



Arg Alalia 

Phe Met Val 
-15 

Arg Ser Ala 
1 

Gin Ser Gin 

Ser Glu Leu 

Gin Asp Leu 
50 

Ala Asn Tyr 
65 

Asn Ala Thr 
80 

Pro Glu Tyr 
He Ser Val 



Tyr Arg Asn 
130 



Gly Leu Val 
-30 

Ala Phe Phe 



Ser Ser Arg 
5 

Asp Val Ala 
20 

Lys Thr Ala 
35 

Gly Trp Gin 



Cys Asp Gly 

Asn His Ala 
85 

Val Pro Lys 
100 

Leu Tyr Phe 
115 

Met Val Val 



Gly Arg 

Lys Val 
-10 

Arg Arg 

Arg Val 

Cys Arg 

Asp Trp 
55 

Glu Cys 
70 

He Val 
Pro Cys 
Asp Asp 



Arg Ala 
135 



As^Gly Pro 

-25 

Ser Glu Val 



Gin Gin Ser 
10 

Ser Ser Ala 
25 

Lys His Glu 
40 

He He Ala 
Ser Phe Pro 



Gin Thr Leu 
90 

Cys Ala Pro 
105 

Asn Ser Asn 
120 

Cys Gly Cys 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2153 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(H) CELL LINE: U2-OS osteosarcoma 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: U2-OS human osteosarcoma cDNA library 

(B) CLONE: U2-16 

(viii) POSITION IN GENOME: 

(C) UNITS: bp 

(ix) FEATURE: 

(A) NAME/KEY: CDS 



(B) LOCATION: 69W.2063 

(ix) FEATURE: 

(A) NAME/ KEY : mat_peptide 

(B) LOCATION: 1647.. 2060 

. (ix) FEATURE: 

(A) NAME/ KEY : IIlRNA 

(B) LOCATION: 1. .2153 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

CTGGTATATT TGTGCCTGCT GGAGGTGGAA TTAACAGTAA GAAGGAGAAA GGGATTGAAT 60 

GGACTTACAG GAAGGATTTC AAGTAAATTC AGGGAAACAC ATTTACTTGA ATAGTACAAC 120 

CTAGAGTATT ATTTTACACT AAGACGACAC AAAAGATGTT AAAGTTATCA CCAAGCTGCC 180 

GGACAGATAT ATATTCCAAC ACCAAGGTGC AGATCAGCAT AGATCTGTGA TTCAGAAATC 240 

AGGATTTGTT TTGGAAAGAG CTCAAGGGTT GAGAAGAACT CAAAAGCAAG TGAAGATTAC 300 

TTTGGGAACT ACAGTTTATC AGAAGATCAA CTTTTGCTAA TTCAAATACC AAAGGCCTGA 360 

TT AT CAT AAA TTCATATAGG AATGCATAGG TCATCTGATC AAATAATATT AGCCGTCTTC 420 

TGCTACATCA ATGCAGCAAA AACTCTTAAC AACTGTGGAT AATTGGAAAT CTGAGTTTCA 480 

GCTTTCTTAG AAATAACTAC TCTTGACATA TTCCAAAATA TTTAAAATAG GACAGGAAAA 540 

TCGGTGAGGA TGTTGTGCTC AGAAATGTCA CTGTCATGAA AAATAGGTAA ATTTGTTTTT 600 

TCAGCTACTG GGAAACTGTA CCTCCTAGAA CCTTAGGTTT TTTTTTTTTT AAGAGGACAA 660 

GAAGGACTAA AAATATCAAC TTTTGCTTTT GGACAAAA ATG CAT CTG ACT GTA 713 

Met His Leu Thr Val 
-316-315 

TTT TTA CTT AAG GGT ATT GTG GGT TTC CTC TGG AGC TGC TGG GTT CTA 761 
Phe Leu Leu Lys Gly lie Val Gly Phe Leu Trp Ser Cys Trp Val Leu 
-310 -305 -300 

GTG GGT TAT GCA AAA GGA GGT TTG GGA GAC AAT CAT GTT CAC TCC AGT 809 
Val Gly Tyr Ala Lys Gly Gly Leu Gly Asp Asn His Val His Ser Ser 
-295 -290 -285 -280 

TTT ATT TAT AGA AGA CTA CGG AAC CAC GAA AGA CGG GAA ATA CAA AGG 857 
Phe He Tyr Arg Arg Leu Arg Asn His Glu Arg Arg Glu He Gin Arg 

-275 -270 -265 

GAA ATT CTC TCT ATC TTG GGT TTG CCT CAC AGA CCC AGA CCA TTT TCA 905 
Glu He Leu Ser He Leu Gly Leu Pro His Arg Pro Arg Pro Phe Ser 
-260 -255 -250 

CCT GGA AAA ATG ACC AAT CAA GCG TCC TCT GCA CCT CTC TTT ATG CTG 953 
Pro Gly Lys Met Thr Asn Gin Ala Ser Ser Ala Pro Leu Phe Met Leu 
-245 -240 -235 

GAT CTC TAC AAT GCC GAA GAA AAT CCT GAA GAG TCG GAG TAC TCA GTA 1001 



Asp Leu Tyr Asn Ala Glu Glwksn Pro Glu Glu Ser Glu TJ^ Ser Val 
-230 -225 -220 



AGG GCA TCC TTG GCA GAA GAG ACC AGA GGG GCA AGA AAG GGA TAC CCA 1049 
Arg" Ala Ser Leu Ala Glu Glu Thr Arg Gly Ala Arg Lys Gly Tyr Pro 
-215 -210 -205 -200 

GCC TCT CCC AAT GGG TAT CCT CGT CGC ATA CAG TTA TCT CGG ACG ACT 1097 
Ala Ser Pro Asn Gly Tyr Pro Arg Arg lie Gin Leu Ser Arg Thr Thr 

-195 -190 -185 

CCT CTG ACC ACC CAG AGT CCT CCT CTA GCC AGC CTC CAT GAT ACC AAC 1145 
Pro Leu Thr Thr Gin Ser Pro Pro Leu Ala Ser Leu His Asp Thr Asn 
a -180 -175 -170 

TTT CTG AAT GAT GCT GAC ATG GTC ATG AGC TTT GTC AAC TTA GTT GAA 1193 
Phe Leu Asn Asp Ala Asp Met Val Met Ser Phe Val Asn Leu Val Glu 
-165 -160 -155 

AGA GAC AAG GAT TTT TCT CAC CAG CGA AGG CAT TAC AAA GAA TTT CGA 1241 
Arg Asp Lys Asp Phe Ser His Gin Arg Arg His Tyr Lys Glu Phe Arg 
-150 -145 -140 

TTT GAT CTT ACC CAA ATT CCT CAT GGA GAG GCA GTG ACA GCA GCT GAA 1289 
Phe Asp Leu Thr Gin lie Pro His Gly Glu Ala Val Thr Ala Ala Glu 
- 135 -130 -125 -120 

TTC CGG ATA TAC AAG GAC CGG AGC AAC AAC CGA TTT GAA AAT GAA ACA 1337 
Phe Arg lie Tyr Lys Asp Arg Ser Asn Asn Arg Phe Glu Asn Glu Thr 

-115 -110 -105 

ATT AAG ATT AGC ATA TAT CAA ATC ATC AAG GAA TAC ACA AAT AGG GAT 1385 
He Lys He Ser He Tyr Gin He He Lys Glu Tyr Thr Asn Arg Asp 
-100 -95 -90 

GCA GAT CTG TTC TTG TTA GAC ACA AGA AAG GCC CAA GCT TTA GAT GTG 1433 
Ala Asp Leu Phe Leu Leu Asp Thr Arg Lys Ala Gin Ala Leu Asp Val 
-85 -80 -75 

GGT TGG CTT GTC TTT GAT ATC ACT GTG ACC AGC AAT CAT TGG GTG ATT 1481 
Gly Trp Leu Val Phe Asp He Thr Val Thr Ser Asn His Trp Val He 
"70 -65 -60 

AAT CCC CAG AAT AAT TTG GGC TTA CAG CTC TGT GCA GAA ACA GGG GAT 1529 
Asn Pro Gin Asn Asn Leu Gly Leu Gin Leu Cys Ala Glu Thr Gly Asp 
-55 -50 -45 -40 

GGA CGC AGT ATC AAC GTA AAA TCT GCT GGT CTT GTG GGA AGA CAG GGA 1577 
Gly Arg Ser He Asn Val Lys Ser Ala Gly Leu Val Gly Arg Gin Gly 

-35 -30 -25 

CCT CAG TCA AAA CAA CCA TTC ATG GTG GCC TTC TTC AAG GCG AGT GAG 1625 
Pro Gin Ser Lys Gin Pro Phe Met Val Ala Phe Phe Lys Ala Ser Glu 
-20 -15 -10 

GTA CTT CTT CGA TCC GTG AGA GCA GCC AAC AAA CGA AAA AAT CAA AAC 1673 
Val Leu Leu Arg Ser Val Arg Ala Ala Asn Lys Arg Lys Asn Gin Asn 
-5 15 

CGC AAT AAA TCC AGC TCT CAT CAG GAC TCC TCC AGA ATG TCC AGT GTT 1721 



Ar-g Asm Lys Ser Ser Ser Hi^Gln Asp Ser Ser Arg Met S^Ser Val 

10 15 20 25 



GGA GAT TAT AAC ACA AGT GAG CAA AAA CAA GCC TGT AAG AAG CAC GAA 1769 
Gly Asp Tyr Asn Thr Ser Glu Gin Lys Gin Ala Cys Lys Lys His Glu 

30 35 40 

CTC *TAT GTG AGC TTC CGG GAT CTG GGA TGG CAG GAC TGG ATT ATA GCA 1817 
Leu Tyr Val Ser Phe Arg Asp Leu Gly Trp Gin Asp Trp lie lie Ala 
45 50 55 

CCA GAA GGA TAC GCT GCA TTT TAT TGT GAT GGA GAA TGT TCT TTT CCA 1865 
Pro Glu Gly Tyr Ala Ala Phe Tyr Cys Asp Gly Glu Cys Ser Phe Pro 
60 65 70 

CTT AAC GCC CAT ATG AAT GCC ACC AAC CAC GCT ATA GTT CAG ACT CTG 1913 
Leu Asn Ala His Met Asn Ala Thr Asn His Ala lie Val Gin Thr Leu 
75 80 85 

GTT CAT CTG ATG TTT CCT GAC CAC GTA CCA AAG CCT TGT TGT GCT CCA 1961 
Val His Leu Met Phe Pro Asp His Val Pro Lys Pro Cys Cys Ala Pro 
90 95 100 105 

ACC AAA TTA AAT GCC ATC TCT GTT CTG TAC TTT GAT GAC AGC TCC AAT 2009 
Thr Lys Leu Asn Ala lie Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn 

110 115 120 

GTC ATT TTG AAA AAA TAT AGA AAT ATG GTA GTA CGC TCA TGT GGC TGC 2057 
Val lie Leu Lys Lys Tyr Arg Asn Met Val Val Arg Ser Cys Gly Cys 
125 130 135 

CAC TAATATTAAA TAATATTGAT AATAACAAAA AGATCTGTAT TAAGGTTTAT 2110 
His 

GGCTGCAATA AAAAGCATAC TTTCAGACAA ACAGAAAAAA AAA 2153 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 454 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met His Leu Thr Val Phe Leu Leu Lys Gly lie Val Gly Phe Leu Trp 
-316 -315 -310 -305 

Ser Cys Trp Val Leu Val Gly Tyr Ala Lys Gly Gly Leu Gly Asp Asn 
-300 -295 -290 -285 

His Val His Ser Ser Phe He Tyr Arg Arg Leu Arg Asn His Glu Arg 

-280 -275 -270 

Arg Glu He Gin Arg Glu He Leu Ser He Leu Gly Leu Pro His Arg 
-265 -260 -255 



Pro Arg Pro Phe Ser Pro Gly Lys Met Thr Asn Gin Ala Ser Ser Ala 
-250 -245 -240 

Pro Leu Phe Met Leu Asp Leu Tyr Asn Ala Glu Glu Asn Pro Glu Glu 
-235 -230 -225 

Ser Glu Tyr Ser Val Arg Ala Ser Leu Ala Glu Glu Thr Arg Gly Ala 
-220 -215 -210 -205 

Arg Lys Gly Tyr Pro Ala Ser Pro Asn Gly Tyr Pro Arg Arg lie Gin 

-200 -195 -190 

Leu Ser Arg Thr Thr Pro Leu Thr Thr Gin Ser Pro Pro Leu Ala Ser 
-185 -180 -175 

Leu His Asp Thr Asn Phe Leu Asn Asp Ala Asp Met Val Met Ser Phe 
-170 -165 -160 

Val Asn Leu Val Glu Arg Asp Lys Asp Phe Ser His Gin Arg Arg His 
-155 -150 -145 

Tyr Lys Glu Phe Arg Phe Asp Leu Thr Gin He Pro His Gly Glu Ala 
-140 -135 -130 -125 

Val Thr Ala Ala Glu Phe Arg He Tyr Lys Asp Arg Ser Asn Asn Arg 

-120 -115 -HO 

Phe Glu Asn Glu Thr He Lys He Ser He Tyr Gin He He Lys Glu 
-105 -100 -95 

Tyr Thr Asn Arg Asp Ala Asp Leu Phe Leu Leu Asp Thr Arg Lys Ala 
-90 -85 -80 

Gin Ala Leu Asp Val Gly Trp Leu Val Phe Asp He Thr Val Thr Ser 
-75 -70 -65 

Asn His Trp Val He Asn Pro Gin Asn Asn Leu Gly Leu Gin Leu Cys 
-60 -55 -50 -45 

Ala Glu Thr Gly Asp Gly Arg Ser He Asn Val Lys Ser Ala Gly Leu 

-40 -35 -30 

Val Gly Arg Gin Gly Pro Gin Ser Lys Gin Pro Phe Met Val Ala Phe 
-25 -20 -15 

Phe Lys Ala Ser Glu Val Leu Leu Arg Ser Val Arg Ala Ala Asn Lys 
-10 -5 1 

Arg Lys Asn Gin Asn Arg Asn Lys Ser Ser Ser His Gin Asp Ser Ser 
5 10 15 20 

Arg Met Ser Ser Val Gly Asp Tyr Asn Thr Ser Glu Gin Lys Gin Ala 

25 30 35 

Cys Lys Lys His Glu Leu Tyr Val Ser Phe Arg Asp Leu Gly Trp Gin 
40 45 50 

Asp Trp He He Ala Pro Glu Gly Tyr Ala Ala Phe Tyr Cys Asp Gly 
55 60 65 



Glu Cys Ser Phe Pro Leu Asn Ala His Met Asn Ala Thr Asn His Ala 
70 75 80 

lie* Val Gin Thr Leu Val His Leu Met Phe Pro Asp His Val Pro Lys 
85' 90 95 100 

Pro Cys Cys Ala Pro Thr Lys Leu Asn Ala lie Ser Val Leu Tyr Phe 

105 110 115 

Asp Asp Ser Ser Asn Val lie Leu Lys Lys Tyr Arg Asn Met Val Val 
120 125 130 



Arg Ser Cys Gly Cys His 
135 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: cDNA to itlRNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
(F) TISSUE TYPE: Human Heart 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: Human heart cDNA library stratagene 

#936208 

(B) CLONE: hH38 

(viii) POSITION IN GENOME: 

(C) UNITS: bp 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 8.. 850 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 427.. 843 

(ix) FEATURE: 

(A) NAME/ KEY: mRNA 

(B) LOCATION: 1..997 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



GAATTCC GAG CCC CAT TGG AAG GAG TTC CGC TTT GAC CTG ACC CAG ATC 
Glu Pro His Trp Lys Glu Phe Arg Phe Asp Leu Thr Gin He 
-139 -135 -130 



49 



C.CG GCT GGG GAG GCG GTC AC^GCT GCG GAG TTC CGG ATT T^ AAG GTG 97 

Pro Ala Gly Glu Ala Val Thr Ala Ala Glu Phe Arg lie Tyr Lys Val 
-125 -120 -115 -110 

CCC AGC ATC CAC CTG CTC AAC AGG ACC CTC CAC GTC AGC ATG TTC CAG 145 
Pro Ser lie His Leu Leu Asn Arg Thr Leu His Val Ser Met Phe Gin 

-105 -100 -95 

GTG GTC CAG GAG CAG TCC AAC AGG GAG TCT GAC TTG TTC TTT TTG GAT 193 
Val Val Gin Glu Gin Ser Asn Arg Glu Ser Asp Leu Phe Phe Leu Asp 
-90 -85 -80 

CTT CAG ACG CTC CGA GCT GGA GAC GAG GGC TGG CTG GTG CTG GAT GTC 241 
Leu Gin Thr Leu Arg Ala Gly Asp Glu Gly Trp Leu Val Leu Asp Val 
-75 -70 -65 

ACA GCA GCC AGT GAC TGC TGG TTG CTG AAG CGT CAC AAG GAC CTG GGA 289 
Thr Ala Ala Ser Asp Cys Trp Leu Leu Lys Arg His Lys Asp Leu Gly 
-60 -55 -50 

CTC CGC CTC TAT GTG GAG ACT GAG GAT GGG CAC AGC GTG GAT CCT GGC 337 
Leu Arg Leu Tyr Val Glu Thr Glu Asp Gly His Ser Val Asp Pro Gly 
"45 -40 -35 -30 

CTG GCC GGC CTG CTG GGT CAA CGG GCC CCA CGC TCC CAA CAG CCT TTC 385 
Leu Ala Gly Leu Leu Gly Gin Arg Ala Pro Arg Ser Gin Gin Pro Phe 

-25 -20 -15 

GTG GTC ACT TTC TTC AGG GCC AGT CCG AGT CCC ATC CGC ACC CCT CGG 433 
Val Val Thr Phe Phe Arg Ala Ser Pro Ser Pro lie Arg Thr Pro Arg 
-10 -5 1 

GCA GTG AGG CCA CTG AGG AGG AGG CAG CCG AAG AAA AGC AAC GAG CTG 481 
Ala Val Arg Pro Leu Arg Arg Arg Gin Pro Lys Lys Ser Asn Glu Leu 
5 10 15 

CCG CAG GCC AAC CGA CTC CCA GGG ATC TTT GAT GAC GTC CAC GGC TCC 529 
Pro Gin Ala Asn Arg Leu Pro Gly lie Phe Asp Asp Val His Gly Ser 
20 25 30 35 

CAC GGC CGG CAG GTC TGC CGT CGG CAC GAG CTC TAC GTC AGC TTC CAG 577 
His Gly Arg Gin Val Cys Arg Arg His Glu Leu Tyr Val Ser Phe Gin 

40 45 50 

GAC CTT GGC TGG CTG GAC TGG GTC ATC GCC CCC CAA GGC TAC TCA GCC 625 
Asp Leu Gly Trp Leu Asp Trp Val He Ala Pro Gin Gly Tyr Ser Ala 
55 60 65 

TAT TAC TGT GAG GGG GAG TGC TCC TTC CCG CTG GAC TCC TGC ATG AAC 673 
Tyr Tyr Cys Glu Gly Glu Cys Ser Phe Pro Leu Asp Ser Cys Met Asn 
70 75 80 

GCC ACC AAC CAC GCC ATC CTG CAG TCC CTG GTG CAC CTG ATG AAG CCA 721 
Ala Thr Asn His Ala He Leu Gin Ser Leu Val His Leu Met Lys Pro 
85 90 95 



AAC GCA GTC CCC AAG GCG TGC TGT GCA CCC ACC AAG CTG AGC GCC ACC 
Asn Ala Val Pro Lys Ala Cys Cys Ala Pro Thr Lys Leu Ser Ala Thr 
100 105 110 115 



769 



TCT GTG CTC TAC TAT GAC AlJ^AGC AAC AAC GTC ATC CTG cW AAG CAC 817 

Ser Val Leu Tyr Tyr Asp Ser Ser Asn Asn Val He Leu Arg Lys His 

120 125 130 

CGC AAC ATG GTG GTC AAG GCC TGC GGC TGC CAC TGAGTCAGCC CGCCCAGCCC 870 
Arg- Asn Met Val Val Lys Ala Cys Gly Cys His 
135 140 

TACTGCAGCC ACCCTTCTCA TCTGGATCGG GCCCTGCAGA GGCAGAAAAC CCTTAAATGC 93 0 

TGTCACAGCT CAAGCAGGAG TGTCAGGGGC CCTCACTCTC GGTGCCTACT TCCTGTCAGG 990 

CTTCTGGGAA TTC 1003 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 281 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Glu Pro His Trp Lys Glu Phe Arg Phe Asp Leu Thr Gin He Pro Ala 
-139 -135 -130 -125 

Gly Glu Ala Val Thr Ala Ala Glu Phe Arg He Tyr Lys Val Pro Ser 
-120 -115 -110 

He His Leu Leu Asn Arg Thr Leu His Val Ser Met Phe Gin Val Val 
-105 -100 -95 

Gin Glu Gin Ser Asn Arg Glu Ser Asp Leu Phe Phe Leu Asp Leu Gin 
-90 -85 -80 

Thr Leu Arg Ala Gly Asp Glu Gly Trp Leu Val Leu Asp Val Thr Ala 
~ 75 -70 -65 -60 

Ala Ser Asp Cys Trp Leu Leu Lys Arg His Lys Asp Leu Gly Leu Arg 

-55 -50 -45 

Leu Tyr Val Glu Thr Glu Asp Gly His Ser Val Asp Pro Gly Leu Ala 
-40 -35 -30 

Gly Leu Leu Gly Gin Arg Ala Pro Arg Ser Gin Gin Pro Phe Val Val 
-25 -20 -15 

Thr Phe Phe Arg Ala Ser Pro Ser Pro He Arg Thr Pro Arg Ala Val 
-10 -5 15 

Arg Pro Leu Arg Arg Arg Gin Pro Lys Lys Ser Asn Glu Leu Pro Gin 

10 15 20 

Ala Asn Arg Leu Pro Gly He Phe Asp Asp Val His Gly Ser His Gly 
25 30 35 

Arg Gin Val Cys Arg Arg His Glu Leu Tyr Val Ser Phe Gin Asp Leu 



r 



40 W 45 50 



Gly Trp Leu Asp Trp Val lie Ala Pro Gin Gly Tyr Ser Ala Tyr Tyr 
55 60 65 

Cys- Glu Gly Glu Cys Ser Phe Pro Leu Asp Ser Cys Met Asn Ala Thr 
70 , 75 80 85 

Asn His Ala lie Leu Gin Ser Leu Val His Leu Met Lys Pro Asn Ala 

90 95 100 

Val Pro Lys Ala Cys Cys Ala Pro Thr Lys Leu Ser Ala Thr Ser Val 
105 110 115 

4 

Leu Tyr Tyr Asp Ser Ser Asn Asn Val lie Leu Arg Lys His Arg Asn 
120 125 130 

Met Val Val Lys Ala Cys Gly Cys His 
135 140 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3623 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: pALBP2-781 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2724.. 3071 

(ix) FEATURE: 

(A) NAME/KEY: terminator 

(B) LOCATION: 3150.. 3218 

(ix) FEATURE: 

(A) NAME/ KEY : RBS 

(B) LOCATION: 2222.. 2723 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GACGAAAGGG CCTCGTGATA CGCCTATTTT TATAGGTTAA TGTCATGATA ATAATGGTTT 60 

CTTAGACGTC AGGTGGCACT TTTCGGGGAA ATGTGCGCGG AACCCCTATT TGTTTATTTT 12 0 

TCTAAATACA TTCAAATATG TATCCGCTCA TGAGACAATA ACCCTGATAA ATGCTTCAAT 180 

AATATTGAAA AAGGAAGAGT ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT 240 

TTGCGGCATT TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 3 00 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC AGCGGTAAGA 3 60 



TCCTTGAGAG 


TTTTCGCCCC 


• 

GAAGAACGTT 


TTCCAATGAT 


• 

GAGCACTTTT AAAGTTCTGC 


420 


TATGTGGCGC 


GGTATTATCC 


CGTATTGACG 


CCGGGCAAGA 




P^TPrPlTS P 
Lutv.bV.AlAL 


480 


ACTATTCTCA 


GAATGACTTG 


GTTGAGTACT 


CACCAGTCAC 


AP-AAAAppAm 


prTT 1 a PP P 7i T>P 
V- X X ALuuA X G 


540 


GCATGACAGT 


AAGAGAATTA 


TGCAGTGCTG 


CCATAACCAT 


uAu X uA X AAL 


At X GLGGLL.A 


600 


ACTTACTTCT 


GACAACGATC 


GGAGGACCGA 


AGGAGCTAAC 


o rp 1 1 ii 1 ii 1 ii 1 ifTl^* 


LAtAALA X GG 


660 


GGGATCATGT 


AACTCGCCTT 


GATCGTTGGG 


AACCGGAGCT 


unn X v?AA\7L L 


ATlPrl A APP 
A X AL V.AAAV.U 


720 


ACGAGtGTGA 


CACCACGATG 


CCTGTAGCAA 


TGGCAACAAC 


p/ftp-pp-p a a a 

VjX XV7LuLAAA 


prnjvrnm7i APTP 
V.1A11 AAL X G 


780 


GCGAACTACT 


TACTCTAGCT 


TCCCGGCAAC 


AATTAATAGA 


CT(Zn A TP.P. A P. 


flPPP&Ta 7i AP 
uLuuAlAAAu 


840 


TTGCAGGACC 


ACTTCTGCGC 


TCGGCCCTTC 


CGGCTGGCTG 


pprnrn * rnrn/^ orp 
ulllAl lutl 


P A Tl A A A TPTIP 

GAXAAAXLIG 


900 


GAGCCGGTGA 


GCGTGGGTCT 


CGCGGTATCA 


TTGCAGCACT 




PPTIA APPPprn 

GGXAAGL.LL.T 


960 


CCCGTATCGT 


AGTTATCTAC 


ACGACGGGGA 


GTCAGGCAAC 


TATPP ATP A » 
X A X GGA X GAA 


CGAAATAGAC 


1020 


AGATCGCTGA 


GATAGGTGCC 


TCACTGATTA 


AGCATTGGTA 


A^ X X UtuAL 


PA APTtrprpTV prn 
UAAGX X lAti 


1080 


CATATATACT 


TTAGATTGAT 


TTAAAACTTC 


ATTTTTAATT 


T A A A & P.P & TP 


T 1 APPrnp A A P A 
X AGG X GAAGA 


1140 


TCCTTTTTGA 


TAATCTCATG 


ACCAAAATCC 


CTTAACGTGA 


i ii 1 ii i ii 1 in/^i i ii 1 1« 
V3 X X X X 11L 


PA PT1P A p ppm 
LAG 1 GAGGGT 


1200 


CAGACCCCGT 


AGAAAAGATC 


AAAGGATCTT 


CTTGAGATCC 


1 1 II 1 II 1 II 1 II 1 11 1 II 1 IpfTl^* 

XXXXXXXuXvj 


GGCGTAATCT 


1260 


GCTGCTTGCA 


AACAAAAAAA 


CCACCGCTAC 


CAGCGGTGGT 


X X la X X X VjV_(-Ij 


PArnoA A^A/^i^ 

GA 1 CAAGAGC 


1320 


TACCAACTCT 


TTTTCCGAAG 


GTAACTGGCT 


TCAGCAGAGC 


u^AuAln^V^A 


A A rp A prp/^mop 
AAXAGXG1GG 


1380 


TTCTAGTGTA 


GCCGTAGTTA 


GGCCACCACT 


TCAAGAACTC 


Tf2T & f P & CCf* 
lulAuLAtLu 


PPTIA PRTIJIPP 

GGXAGAXAGC 


1440 


TCGCTCTGCT 


AATCCTGTTA 


CCAGTGGCTG 


CTGCCAGTGG 


LuAlAAultu 


TGTCTTACCG 


1500 


GGTTGGACTC 


AAGACGATAG 


TTACCGGATA 


AGGCGCAGCG 


(a X v_(j(j(i(_ 'X 'GA 


ACGGGGGGTT 


1560 


CGTGCACACA 


GCCCAGCTTG 


GAGCGAACGA 


CCTACACCGA 


ACTGAGATAC 


CTACAGCGTG 


1620 


AGCATTGAGA 


AAGCGCCACG 


CTTCCCGAAG 


GGAGAAAGGC 


GGAOAGGTAT 


CCGGTAAGCG 


1680 


GCAGGGTCGG 


AACAGGAGAG 


CGCACGAGGG 


AGCTTCCAGG 


PPPl TV TV OPPO 

GGGAAAv^GUU 


TGGTATCTTT 


1740 


ATAGTCCTGT 


CGGGTTTCGC 


CACCTCTGAC 


TTGAGCGTCG 


AX XXX XGTGA 


TGCTCGTCAG 


1800 


GGGGGCGGAG 


CCTATGGAAA 


AACGCCAGCA 


ACGCGGCCTT 


rnpnrpTi PPPT"T*P 
XXX ALub X X V» 


G X GGG GT ITT 


1860 


GCTGGCCTTT 


TGCTCACATG 


TTCTTTCCTG 


CGTTATCCCC 


X un X ILlulu 


P A H A A PPPrfl A 

GAX AAGGG1 A 


1920 


TTACCGCCTT 


TGAGTGAGCT 


GATACCGCTC 


GCCGCAGCCG 




PPP APPP A prn 
LuLAuGGAGX 


1980 


CAGTGAGCGA 


GGAAGCGGAA 


GAGCGCCCAA 


TACGCAAACC 


GCCTCTCCCC 


GCGCGTTGGC 


2040 


^^^^ TV m^n tv ^n^n •» 

CGATTCATTA 


ATGCAGAATT 


GATCTCTCAC 


CTACCAAACA 


ATGCCCCCCT 


GCAAAAAATA 


2100 


AATTCATATA 


AAAAACATAC 


AGATAACCAT 


CTGCGGTGAT 


AAATTATCTC 


TGGCGGTGTT 


2160 



GACATAAATA 


CCACTGGCGG 


TGATACTGAG 


CACATCAGCA 


GGACGCACTG 


ACCACCATGA 


2220 


AGGTGACGCT 


CTTAAAAATT 


AAGCCCTGAA 


GAAGGGCAGC 


ATTCAAAGCA 


GAAGGCTTTG 


2280 


GGGTGTGTGA 


TACGAAACGA 


AGCATTGGCC 


GTAAGTGCGA 


TTCCGGATTA 


GCTGCCAATG 


2340 


TGCCAATCGC 


GGGGGGTTTT 


CGTTCAGGAC 


TACAACTGCC 


ACACACCACC 


AAAGCTAACT 


2400 


GACAGGAGAA 


TCCAGATGGA 


TGCACAAACA 


CGCCGCCGCG 


AACGTCGCGC 


AGAGAAACAG 


2460 


GCTCAATGGA 


AAGCAGCAAA 


TCCCCTGTTG 


GTTGGGGTAA 


GCGCAAAACC 


AGTTCCGAAA 


2520 


GATTTTTTTA 


ACTATAAACG 


CTGATGGAAG 


CGTTTATGCG 


GAAGAGGTAA 


AGCCCTTCCC 


2580 


GAGTAACAAA 


AAAACAACAG 


CATAAATAAC 


CCCGCTCTTA 


CACATTCCAG 


CCCTGAAAAA 


2640 


GGGCATCAAA 


TTAAACCACA 


CCTATGGTGT 


ATGCATTTAT 


TTGCATACAT 


TCAATCAATT 


2700 


GTTATCTAAG 


GAAATACTTA 


CAT ATG CAA GCT AAA CAT AAA CAA CGT AAA 


2750 



Met Gin Ala Lys His Lys Gin Arg Lys 
1 5 

CGT CTG AAA TCT AGC TGT AAG AGA CAC CCT TTG TAC GTG GAC TTC AGT 2798 
Arg Leu Lys Ser Ser Cys Lys Arg His Pro Leu Tyr Val Asp Phe Ser 
10 15 20 25 

GAC GTG GGG TGG AAT GAC TGG ATT GTG GCT CCC CCG GGG TAT CAC GCC 2846 
Asp Val Gly Trp Asn Asp Trp He Val Ala Pro Pro Gly Tyr His Ala 

30 35 40 

TTT TAC TGC CAC GGA GAA TGC CCT TTT CCT CTG GCT GAT CAT CTG AAC 2894 
Phe Tyr Cys His Gly Glu Cys Pro Phe Pro Leu Ala Asp His Leu Asn 
45 50 55 

TCC ACT AAT CAT GCC ATT GTT CAG ACG TTG GTC AAC TCT GTT AAC TCT 2942 
Ser Thr Asn His Ala He Val Gin Thr Leu Val Asn Ser Val Asn Ser 
60 65 70 

AAG ATT CCT AAG GCA TGC TGT GTC CCG ACA GAA CTC AGT GCT ATC TCG 2990 
Lys He Pro Lys Ala Cys Cys Val Pro Thr Glu Leu Ser Ala He Ser 
75 80 85 

ATG CTG TAC CTT GAC GAG AAT GAA AAG GTT GTA TTA AAG AAC TAT CAG 3038 
Met Leu Tyr Leu Asp Glu Asn Glu Lys Val Val Leu Lys Asn Tyr Gin 
90 95 100 105 

GAC ATG GTT GTG GAG GGT TGT GGG TGT CGC TAGTACAGCA AAATTAAATA 3088 
Asp Met Val Val Glu Gly Cys Gly Cys Arg 

HO 115 



CATAAATATA 


TATATATATA 


TATATTTTAG 


AAAAAAGAAA 


AAAATCTAGA 


GTCGACCTGC 


3148 


AGTAATCGTA 


CAGGGTAGTA 


CAAATAAAAA 


AGGCACGTCA 


GATGACGTGC 


CTTTTTTCTT 


3208 


GTGAGCAGTA 


AGCTTGGCAC 


TGGCCGTCGT 


TTTACAACGT 


CGTGACTGGG 


AAAACCCTGG 


3268 


CGTTACCCAA 


CTTAATCGCC 


TTGCAGCACA 


TCCCCCTTTC 


GCCAGCTGGC 


GTAATAGCGA 


3328 


AGAGGCCCGC 


ACCGATCGCC 


CTTCCCAACA 


GTTGCGCAGC 


CTGAATGGCG 


AATGGCGCCT 


3388 



GATGCGGTAT TTTCTCCTTA CGCATCTGTG CGGTATTTCA CACCGCATAT ATGGTGCACT 3448 
CTCAGTACAA TCTGCTCTGA TGCCGCATAG TTAAGCCAGC CCCGACACCC GCCAACACCC 3508 
GCT6ACGCGC CCTGACGGGC TTGTCTGCTC CCGGCATCCG CTTACAGACA AGCTGTGACC 3568 

ft 

GTCTCCGGGA GCTGCATGTG TCAGAGGTTT TCACCGTCAT CACCGAAACG CGCGA 3 623 

(2) TNFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Gin Ala Lys His Lys Gin Arg Lys Arg Leu Lys Ser Ser Cys Lys 
15 10 15 

Arg His Pro Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn Asp Trp 
20 25 30 

He Val Ala Pro Pro Gly Tyr His Ala Phe Tyr Cys His Gly Glu Cys 
35 40 45 

Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr Asn His Ala He Val 
50 55 60 

Gin Thr Leu Val Asn Ser Val Asn Ser Lys He Pro Lys Ala Cys Cys 
65 70 75 80 

Val Pro Thr Glu Leu Ser Ala He Ser Met Leu Tyr Leu Asp Glu Asn 

85 90 95 

Glu Lys Val Val Leu Lys Asn Tyr Gin Asp Met Val Val Glu Gly Cys 
100 105 110 

Gly Cys Arg 
115 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



CATGGGCAGC TGAG W 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 
* (A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
GAGGGTTGTG GGTGTCGCTA GTGAGTCGAC TACAGCAAAT 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
GGATGTGGGT GCCGCTGACT CTAGAGTCGA CGGAATTC 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
AATTCACCAT GATTCCTGGT AACCGAATGC T 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 
GTGGTACTAA GGACCATTGG CTTAC 
(2)^ INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
* (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
CGACCTGCAG CCATGCATCT GACTGTA 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 
TGCCTGCAGT TTAATATTAG TGGCAGC 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
CGACCTGCAG CCACC 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



(D) TOPOLOGY: 11 




(ii) MOLECULE TYPE: DNA 



.(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
TCGACCCACC ATGCCGGGGC TGGGGCGGAG GGCGCAGTGG CTGTGCTGGT GGTGGGGGCT 60 
GTGCPGCAGC TGCTGCGGGC C 81 

(2) Information for seq id no: 24: 

(i) sequence characteristics: 

(A) LENGTH: 73 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CGCAGCAGCT GCACAGCAGC CCCCACCACC AGCACAGCCA CTGCGCCCTC CGCCCCAGCC 
CCGGCATGGT GGG 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
TCGACTGGTT T 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



CGAAACCAG ^ 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

c (ii) MOLECULE TYPE: DNA 



A 'xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
TCGACAGGCT CGCCTGCA 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 
GTCCGAGCGG 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
CAGGTCGACC CACCATGCAC GTGCGCTCA 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
TCTGTCGACC TCGGAGGAGC TAGTGGC 27 
(2)-, INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1794 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(•ii) MOLECULE TYPE: DNA 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: bmp-3 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3 21.. 113 6 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

AGATCTTGAA AACACCCGGG CCACACACGC CGCGACCTAC AGCTCTTTCT CAGCGTTGGA 60 

GTGGAGACGG CGCCCGCAGC GCCCTGCGCG GGTGAGGTCC GCGCAGCTGC TGGGGAAGAG 12 0 

CCCACCTGTC AGGCTGCGCT GGGTCAGCGC AGCAAGTGGG GCTGGCCGCT ATCTCGCTGC 180 

ACCCGGCCGC GTCCCGGGCT CCGTGCGCCC TCGCCCCAGC TGGTTTGGAG TTCAACCCTC 240 

GGCTCCGCCG CCGGCTCCTT GCGCCTTCGG AGTGTCCCGC AGCGACGCCG GGAGCCGACG 300 

CGCCGCGCGG GTACCTAGCC ATG GCT GGG GCG AGC AGG CTG CTC TTT CTG 350 

Met Ala Gly Ala Ser Arg Leu Leu Phe Leu 
15 10 

TGG CTG GGC TGC TTC TGC GTG AGC CTG GCG CAG GGA GAG AGA CCG AAG 398 
Trp Leu Gly Cys Phe Cys Val Ser Leu Ala Gin Gly Glu Arg Pro Lys 

15 20 25 

CCA CCT TTC CCG GAG CTC CGC AAA GCT GTG CCA GGT GAC CGC ACG GCA 446 
Pro Pro Phe Pro Glu Leu Arg Lys Ala Val Pro Gly Asp Arg Thr Ala 
30 35 40 

GGT GGT GGC CCG GAC TCC GAG CTG CAG CCG CAA GAC AAG GTC TCT GAA 494 
Gly Gly Gly Pro Asp Ser Glu Leu Gin Pro Gin Asp Lys Val Ser Glu 
45 50 55 

CAC ATG CTG CGG CTC TAT GAC AGG TAC AGC ACG GTC CAG GCG GCC CGG 542 
His Met Leu Arg Leu Tyr Asp Arg Tyr Ser Thr Val Gin Ala Ala Arg 
60 65 70 

ACA CCG GGC TCC CTG GAG GGA GGC TCG CAG CCC TGG CGC CCT CGG CTC 590 
Thr Pro Gly Ser Leu Glu Gly Gly Ser Gin Pro Trp Arg Pro Arg Leu 
75 80 85 90 



CTG CGC GAA GGC AAC ACG GTT CGC AGC TTT CGG GCG GCA GCA GCA GAA 



638 



Leu Arg Glu Gly Asn Thr Va^Rrg Ser Phe Arg Ala Ala A]WAla Glu 

95 100 105 

ACT CTT GAA AGA AAA GGA CTG TAT ATC TTC AAT CTG ACA TCG CTA ACC 686 
Thr Leu Glu Arg Lys Gly Leu Tyr lie Phe Asn Leu Thr Ser Leu Thr 
110 115 120 

AAG TCT GAA AAC ATT TTG TCT GCC ACA CTG TAT TTC TGT ATT GGA GAG 734 
Lys Ser Glu Asn lie Leu Ser Ala Thr Leu Tyr Phe Cys lie Gly Glu 
125 130 135 

CTA QGA AAC ATC AGC CTG AGT TGT CCA GTG TCT GGA GGA TGC TCC CAT 782 
Leu Gly Asn lie Ser Leu Ser Cys Pro Val Ser Gly Gly Cys Ser His 
1*40 145 150 

CAT GCT CAG AGG AAA CAC ATT CAG ATT GAT CTT TCT GCA TGG ACC CTC 83 0 

His Ala Gin Arg Lys His He Gin He Asp Leu Ser Ala Trp Thr Leu 
15 5 160 165 170 

AAA TTC AGC AGA AAC CAA AGT CAA CTC CTT GGC CAT CTG TCA GTG GAT 878 
Lys Phe Ser Arg Asn Gin Ser Gin Leu Leu Gly His Leu Ser Val Asp 

175 180 185 

ATG GCC AAA TCT CAT CGA GAT ATT ATG TCC TGG CTG TCT AAA GAT ATC 926 
Met Ala Lys Ser His Arg Asp He Met Ser Trp Leu Ser Lys Asp He 
190 195 200 

ACT CAA TTC TTG AGG AAG GCC AAA GAA AAT GAA GAG TTC CTC ATA GGA 974 
Thr Gin Phe Leu Arg Lys Ala Lys Glu Asn Glu Glu Phe Leu He Gly 
205 210 215 

TTT AAC ATT ACG TCC AAG GGA CGC CAG CTG CCA AAG AGG AGG TTA CCT 1022 
Phe Asn He Thr Ser Lys Gly Arg Gin Leu Pro Lys Arg Arg Leu Pro 
220 225 230 

TTT CCA GAG CCT TAT ATC TTG GTA TAT GCC AAT GAT GCC GCC ATT TCT 1070 
Phe Pro Glu Pro Tyr He Leu Val Tyr Ala Asn Asp Ala Ala He Ser 
235 240 245 250 

GAG CCA GAA AGT GTG GTA TCA AGC TTA CAG GGA CAC CGG AAT TTT CCC 1118 
Glu Pro Glu Ser Val Val Ser Ser Leu Gin Gly His Arg Asn Phe Pro 

255 260 265 

ACT GGA ACT GTT CCC AAA TGGGATAGCC ACATCAGAGC TGCCCTTTCC 1166 
Thr Gly Thr Val Pro Lys 
270 

ATTGAGCGGA GGAAGAAGCG CTCTACTGGG GTCTTGCTGC CTCTGCAGAA CAACGAGCTT 122 6 

CCTGGGGCAG AATACCAGTA TAAAAAGGAT GAGGTGTGGG AGGAGAGAAA GCCTTACAAG 1286 

ACCCTTCAGG CTCAGGCCCC TGAAAAGAGT AAGAATAAAA AGAAACAGAG AAAGGGGCCT 1346 

CATCGGAAGA GCCAGACGCT CCAATTTGAT GAGCAGACCC TGAAAAAGGC AAGGAGAAAG 1406 

CAGTGGATTG AACCTCGGAA TTGCGCCAGG AGATACCTCA AGGTAGACTT TGCAGATATT 1466 

GGCTGGAGTG AATGGATTAT CTCCCCCAAG TCCTTTGATG CCTATTATTG CTCTGGAGCA 1526 



TGCCAGTTCC CCATGCCAAA GTCTTTGAAG CCATCAAATC ATGCTACCAT CCAGAGTATA 1586 



GTGAGAGCTG TGGGGGTCGT TCCTGGGATT CCTGAGCCTT GCTGTGTACC AGAAAAGATG 1646 

TCCTCACTCA GTATTTTATT CTTTGATGAA AATAAGAATG TAGTGCTTAA AGTATACCCT 1706 

AACATGACAG TAGAGTCTTG CGCTTGCAGA TAACCTGGCA AAGAACTCAT TTGAATGCTT 1766 

AATTCAATCT CTAGAGTCGA CGGAATTC 1794 

(2) INFORMATION FOR SEQ ID NO: 32: 

'* (i) SEQUENCE CHARACTERISTICS: 

5 (A) LENGTH: 272 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Met Ala Gly Ala Ser Arg Leu Leu Phe Leu Trp Leu Gly Cys Phe Cys 
1 5 io " " 15 

Val Ser Leu Ala Gin Gly Glu Arg Pro Lys Pro Pro Phe Pro Glu Leu 
20 25 30 

Arg Lys Ala Val Pro Gly Asp Arg Thr Ala Gly Gly Gly Pro Asp Ser 
35 40 45 

Glu Leu Gin Pro Gin Asp Lys Val Ser Glu His Met Leu Arg Leu Tyr 
50 55 60 

Asp Arg Tyr Ser Thr Val Gin Ala Ala Arg Thr Pro Gly Ser Leu Glu 
65 70 75 80 

Gly Gly Ser Gin Pro Trp Arg Pro Arg Leu Leu Arg Glu Gly Asn Thr 

85 90 ~ 95 

Val Arg Ser Phe Arg Ala Ala Ala Ala Glu Thr Leu Glu Arg Lys Gly 
100 105 110 

Leu Tyr He Phe Asn Leu Thr Ser Leu Thr Lys Ser Glu Asn He Leu 
115 120 125 

Ser Ala Thr Leu Tyr Phe Cys He Gly Glu Leu Gly Asn He Ser Leu 
130 135 140 

Ser Cys Pro Val Ser Gly Gly Cys Ser His His Ala Gin Arg Lys His 
145 150 155 160 

He Gin He Asp Leu Ser Ala Trp Thr Leu Lys Phe Ser Arg Asn Gin 

165 170 175 

Ser Gin Leu Leu Gly His Leu Ser Val Asp Met Ala Lys Ser His Arg 
180 185 " 190 

Asp He Met Ser Trp Leu. Ser Lys Asp He Thr Gin Phe Leu Arg Lys 
195 200 205 



Ala Lys Glu Asn Glu Glu Ph^^eu lie Gly Phe Asn lie TnWser Lys 

210 215 220 

Gly Arg Gin Leu Pro Lys Arg Arg Leu Pro Phe Pro Glu Pro Tyr lie 
225 230 235 240 

Leu.;.Val Tyr Ala Asn Asp Ala Ala He Ser Glu Pro Glu Ser Val Val 

245 250 255 

Ser Ser Leu Gin Gly His Arg Asn Phe Pro Thr Gly Thr Val Pro Lys 
260 265 270 

(2) Information for seq id no: 33: 

(i) sequence characteristics: 

(A) LENGTH: 449 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

ATGTCTCATA ATCGTTCTAA AACTCCAAAA AATCAAGAAG CTCTGCGTAT GGCCAACGTG 60 

GCAGAGAACA GCAGCAGCGA CCAGAGGCAG GCCTGTAAGA AGCACGAGCT GTATGTCAGC 120 

TTCCGAGACC TGGGCTGGCA GGACTGGATC ATCGCGCCTG AAGGCTACGC CGCCTACTAC 180 

TGTGAGGGGG AGTGTGCCTT CCCTCTGAAC TCCTACATGA ACGCCACCAA CCACGCCATC 240 

GTGCAGACGC TGGTCCACTT CATCAACCCG GAAACGGTGC CCAAGCCCTG CTGTGCGCCC 300 

ACGCAGCTCA ATGCCATCTC CGTCCTCTAC TTCGATGACA GCTCCAACGT CATCCTGAAG 3 60 

AAATACAGAA ACATGGTGGT CCGGGCCTGT GGCTGCCACT AGCTCCTCCG AGAATTCAGA 420 

CCCTTTGGGG CCAAGTTTTT CTGGATCCT 449 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
CAAGAAGGAG ATATACAT 
(2) INFORMATION FOR SEQ ID NO: 35: 



18 




(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 377 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 



ATGCGTAAAC 


AATGGATTGA 


ACCACGTAAC 


TGTGCTCGTC 


GTTATCTGAA 


AGTAGACTTT 


60 


GCAGATATTG 


GCTGGAGTGA 


ATGGATTATC 


TCCCCCAAGT 


CCTTTGATGC 


CTATTATTGC 


120 


TCTGGAGCAT 


GCCAGTTCCC 


CATGCCAAAG 


TCTTTGAAGC 


CATCAAATCA 


TGCTACCATC 


180 


CAGAGTATAG 


TGAGAGCTGT 


GGGGGTCGTT 


CCTGGGATTC 


CTGAGCCTTG 


CTGTGTACCA 


240 


GAAAAGATGT 


CCTCACTCAG 


TATTTTATTC 


TTTGATGAAA 


ATAAGAATGT 


AGTGCTTAAA 


300 


GTATACCCTA 


ACATGACAGT 


AGAGTCTTGC 


GCTTGCAGAT 


AACCTGGCAA 


AGAACTCATT 


360 


TGAATGCTTA 


ATTCAAT 










377 



